20. Edge Cases

ND0063 C1 L4 18 Edge Case- [#1] Video

Many applications and services lend themselves to being monitored and maintained. When you run into an application that does not, it is no less important (it's like more important) to monitor, alert and maintain these applications. You may find yourself needing to go to extremes in order to pull these systems into your monitoring framework, but if you do not, you are putting yourself at risk for letting faults go undetected. Ensuring coverage of all of the components of your platform, documenting and training staff to understand the platform and practicing what to do in the case of outages will help ensure the highest uptime for your company.

Monitoring challenges

What should you do if there are not straightforward ways to monitor an important system component?

SOLUTION:
  • Find a more complex way
  • Alter the system to make it more easily monitorable

Crucial items

Which are crucial items to allow for systems to be recovered in a timely manner?

SOLUTION:
  • Documentation
  • People empowered to decide when systems require recovery
  • Configuration in place to allow for recovery